Picture for Qingyun Wu

Qingyun Wu

When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs

Add code
May 22, 2026
Viaarxiv icon

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

On Emotion-Sensitive Decision Making of Small Language Model Agents

Add code
Apr 08, 2026
Viaarxiv icon

MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation

Add code
Mar 24, 2026
Viaarxiv icon

Live-Evo: Online Evolution of Agentic Memory from Continuous Feedback

Add code
Feb 02, 2026
Viaarxiv icon

Do Images Speak Louder than Words? Investigating the Effect of Textual Misinformation in VLMs

Add code
Jan 27, 2026
Viaarxiv icon

Sliding Window Attention Adaptation

Add code
Dec 16, 2025
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Figure 1 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 2 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 3 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 4 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Viaarxiv icon

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Add code
May 07, 2025
Figure 1 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 2 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 3 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 4 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Viaarxiv icon

Divide, Optimize, Merge: Fine-Grained LLM Agent Optimization at Scale

Add code
May 06, 2025
Viaarxiv icon